Corpus-based modeling of naturalness es non-native spee
نویسندگان
چکیده
In this paper, aiming at automatic estimation of naturalness in timing control of non-native’s speech, we have analyzed the timing characteristics of non-native’s speech to correlate with the corresponding subjective naturalness evaluation scores given by native speakers. Through statistical analyses using English speech data spoken by Japanese with temporal naturalness scores ranging one to five given by natives, we found high correlation between their scores and the differences from native’s speech. These analyses provided a linear regression model where naturalness in timing control is estimated by differences from native’s speech in durations of overall sentences, individual content and function words and pauses. The proposed naturalness evaluation model was tested its estimation accuracy using open data. The root mean square errors 0.64 between scores predicted by the model and those given by the natives turned out to be comparable to the differences 0.85 of scores among native listeners. Good correlation between model prediction and native’s judgments confirmed the appropriateness of the proposed model.
منابع مشابه
Dissertation Summary Recognizing Non-native Speech: Characterizing and Adapting to Non-native Usage in Lvcsr
Low-pro ien y non-native speakers represent a signi ant hallenge for large-vo abulary ontinuous spee h re ognition (LVCSR). A ousti models are onfused by a heavy a ent; language models are onfused by poor grammar and un onventional word hoi e. La k of omfort with the spoken language a e ts the fundamental properties of onne ted spee h that have been a fo us of LVCSR resear h; ross-word and inte...
متن کاملAnalysis of the phone level contributions to objective evaluation of English speech by non-natives
Aiming at automatic estimation of naturalness in timing control of non-native’s speech, we have analyzed the timing characteristics of non-native’s speech to correlate with corresponding subjective naturalness evaluation scores given by native speakers. In addition to word level statistical characteristics showing the differences between natives and nonnatives, we analyzed phone and syllable le...
متن کاملThe Use of Lexical Bundles in Native and Non-native Post-graduate Writing: The Case of Applied Linguistics MA Theses
Connor et al. (2008) mention “specifying textual requirements of genres” (p.12) as one of the reasons which have motivated researchers in the analysis of writing. Members of each genre should be able to produce and retrieve these textual requirements appropriately to be considered communicatively proficient. One of the textual requirements of genres is regularities of specific forms and content...
متن کاملHedges and Boosters in Academic Writing: Native vs. Non-Native Research Articles in Applied Linguistics and Engineering
The expression of doubt and certainty is crucial in academic writing where the authors have to distinguish opinion from fact and evaluate their assertions in acceptable and persuasive ways. Hedges and boosters are two strategies used for this purpose. Despite their importance in academic writing, we know little about how they are used in different disciplines and genres and how foreign language...
متن کاملA Corpus-based Analysis of Epistemic Stance Adverbs in Essays Written by Native English Speakers and Iranian EFL Learners
Academic essays entail taking a stance on the truth value of propositions. Epistemic adverbs deal with the speaker's assessment of the truth value of propositions. Employing a corpus-based approach with descriptive statistics and qualitative description, this study explored the use of epistemic stance adverbs in academic essays written by native English speakers and Iranian EFL learners. Follow...
متن کامل